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[57] ABSTRACT 

In a computer system having at least two processors, 
each processor having an associated memory, the pro- 
cessors being coupled to one another through an inter- 
face unit by means of a bus, hardware semaphores to 
regulate access to shared resources are disclosed. Each 
semaphore is one bit wide and can be written to obtain 
the desired state. When reading the semaphore, if the 
contents is a one, then a one is returned. If the content 
is zero, a zero is returned but the semaphore is automati- 
cally reset to one. 

4 Claims, 2 Drawing Sheets 




30^ 



04/29/2004, EAST Version: 1.4.1 



U.S. Patent Jan. 4, 1994 sheet 1 of 2 5,276,886 






04/29/2004, EAST Version: 1.4.1 



U.S. Patent 



Jan. 4, 1994 



Sheet 2 of 2 



5,276,886 




04/29/2004, EAST Version: 1.4.1 



5,276,886 

1 2 



HARDWARE SEMAPHORES IN A 
MULTI-PROCESSOR ENV'IROI^ENT 

BACKGROUND OF THE INVENTION 5 

This invention is in the field of muhi-processor digital 
computer systems. In particular, it relates to the syn- 
chronization of communication between a plurality of 
processors. 10 

In the multi-processor environment, various re- 
sources can be accessed by each of the processors. For 
example, input/output devices such as printers and 
CRTs may be used at different times by each and every 
processor. 15 

In certain situations, the use of certain resources, if 
these resources are accessed concurrently by different 
processors, can result in errors due to data coherency. 
For example, if a general purpose register is used as a 
counter and if two different processors read, increment 
and write the register at the same time, the count in the 
register will be incorrect. 

For resources where utilization by multiple proces- 
sors can create data coherency problems, some method 25 
to synchronize and control access to the resource must 
be implemented. Although software "Test and Set" 
flags exist, they are not without drawbacks. In particu- 
lar, as each setting and testing of the flag requires the 
processor to obtain busmastership, a great deal of time is 30 
spent gaining busmastership for a relatively simple task. 
This wasted time can add up quickly in a multi*proces- 
sor environment. A method and apparatus which can 
coordinate access to resources between the multiple 
processors in a minimum amount of time and without 
requiring extensive additional hardware or software is 
therefore desirable. 

SUMMARY OF THE INVENTION 

40 

The present invention facilitates the synchronization 
of access to shared resources between a local and a host 
processor. Hardware semaphores are provided in an 
interface unit between at least two processors. The 
hardware semaphores can be in one of two states— clear 45 
or set. Writing to a semaphore forces it to the enumer- 
ated state. Reading a semaphore returns the current 
state of the semaphore. In addition, if the semaphore is 
"clear", then the read operation changes the state to 
"set" but returns "clear" as the current state. 50 

If the semaphore is "set" (on), then the resource it 
represents cannot be used by the processor attempting 
to use the resource. If the semaphore is "clear", the 
processor requesting access is given that access, but the 
semaphore is simultaneously "set" so no other proces- 
sor can use the resource. As the semaphore exists in 
hardware and as bus mastership is not required to access 
the semaphore, the implementation and operation of 
this type of hardware semaphore is much more efficient ^ 
than known "test and set'* software flags. 

The present invention will now be described in detail 
in conjunction with the figures described below. 

BRIEF DESCRIPTION OF THE DRAWINGS 

65 

FIG. 1 is a block diagram of a typical operating envi- 
ronment for the present invention; and 
FIG. 2 is a block diagram of the present invention. 



DESCRIPTION OF THE SPECIFIC 
EMBODIMENT 

A typical operating environment for the present in- 
vention is shown in FIG. 1. Local Central Processing 
Unit CCPU') 12 is coupled to both local memory 14 and 
interface unit 30. In turn the interface unit is coupled by 
a microchannel architecture ('MCA') bus 17 to host 
CPU 22 and host memory 24. Local CPU 12 can access 
host memory 24 through interface unit 30, and host 
CPU 22 can access local memory 12 in the same man- 
ner. Clock 18 provides tinung signals for the system. 

Interface Unit 

FIG. 2 is a block diagram of interface unit 30. Inter- 
face unit 30 is comprised of six primary functional units: 
local bus interface unit 70, microchannel interface unit 
80, block transfer 90, semaphores and general purpose 
register 100, register arbiter 110, and interrupt control- 
ler 120. 

Local bus interface unit 70 provides the necessary 
support to interface with several different types of 
CPUs, DRAMs/SRAMs, and I/O devices. It can inter- 
face synchronously to 80286. 80386SX, or TMS 
320C25. Local I/O devices are only accessible by the 
local CPU. Local memory 14 can be accessed by the 
local CPU, host CPU, and the interface unit's block 
transfer unit 90. Interface unit 30 supports both byte and 
word transfers. The interface logic includes memory 
controller 72 for DRAMs and SRAMs, bus arbiter 74. 
bus transceivers 76, and control unit 78. Control unit 78 
handles local mapped memory, I/O address detection 
and address translation. It also controls explicit and 
implicit busmastership for gaining and relinquishing 
control of the microchannel architecture bus. In addi- 
tion, it provides logic to keep the local CPU in reset 
state so that the host CPU is able to initialize the inter- 
face imit and the local memory. 

Bus transceivers 76 comprise data and address pipe- 
line-latches, and steering logic to enable the local CPU 
to access local or local mapped memory I/O. The latter 
includes a dead-lock resolution mechanism. Both byte 
and word transfers are supported. The address pipeline 
is 2-levels deep and the data pipelines is one level. The 
first level in the address pipeline is used to latch the 
address. This is necessary, given the pipeline addressing 
nature of the local CPU. The second level of the ad- 
dress pipeline along with the data pipeline is provided 
to boost local mapped memory write throughput. 

Memory controller 72 can support either SRAM or 
DRAM. The SRAM memory can be configured as one 
64K word bank. The low and high bytes can be ac- 
cessed independently. The DRAM memory can be up 
to 16M bytes in size, configured as four banks, with 
256K byte granularity. 256KX1, 256KX4, IMxl, lMx4, 
4Mxl, and 4Mx4 devices must be used. The addresses 
are internally latched and multiplexed onto the DRAM 
address bus, row address followed by column address. 4 
RAS's, one per bank, and 2 CAS's, for odd and even 
addresses are provided. The controller supports page- 
mode access and CAS before RAS refresh. 

Bus arbiter 74 arbitrates among requests for local 
busmastership. Local bus requests can include refresh 
requests, block transfer requests, and host mapped 
memory access. The refresh request has the highest 
priority, followed by host mapped memory access and 
block transfer requests. Upon receiving a bus request, a 
hold request is sent to the local CPU. When the local 
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CPU responds with a hold acknowledge, a refresh re- only by the local CPU. registers writable by both the 

quest acknowledge is generated, if the refresh request is host and local CPU. registers writable by both CPUs 

active. Otherwise, a hold acknowledge is generated in according to the contents of an enable write register 

re^nse to host mapped access. In the absence of the and a group writable only by the host CPU To arbitrate 

first two requests, a block transfer acknowledge is gen- 5 between the two CPUs, the host CPU control signals 

V 1 . , arc synchronized to the local CPU clock. If both CPUs 

MicroChannel interface unit 80 is further comprised try to access interface unit registers simultaneously 

of slave controller 81, master controller 82, address priority is given to the host CPU. The ready line to the 

generator 83, address comparator 84, programmable local CPU is de-asserted until the host CPU has com- 
option select registers 85, dau parity 86, and address, 10 pletcd its access. If a register access is already in 

data and bus control transceivers 87. progress, the CPU trying to access the interface imif s 

Master controller 82 controls the data transfer cycles registers is put into a wait state. The arbiter multiplexes 

when mterfacc unit 30 is the master on the microchan- the local and host addresses and read/write signals onto 

nel bus. In master mode master controller 82 can either the register address bus and the register read/write 

be executing a local mapped memory cycle, a local 15 cycle is performed. 

mapped I/O cycle or performing a block transfer. The Semaphores and General purpose registers 100 corn- 
master controller also mcludes steering logic for per- prise 8 16-bit general purpose registers and 16 scma- 
formmg byte, word and double word accesses on the phorcs to synchronize the communication between the 
microchannel. Double word transfers are supported local and host CPUs. 

only during block transfers in streaming mode. 20 Block transfer 90 incorporates logic for high speed 

Slave controller 81 executes either host mapped transfer of data blocks between the host memory or I/O 

memory cycles or host I/O cycles. For these opera- and the local memory, under the control of the interface 

tions, the interface unit 30 will always appear as a 16 bit unit. 

device. Once a valid address is detected on the micro- Interrupt controller 120 is programmed to generate 

channel, the slave controller is triggered. Complying 25 interrupts to the local and host CPUs upon detection of 

with the microchannel protocol, the slave controller various extraordinary conditions. There are 13 sources 

sends applicable return signals to the host and sends a that can interrupt the local CPU and 9 sources that can 

local bus request to the local arbiter if it is a host interrupt the host CPU. Upon receiving the interrupt, 

mapped memory cycle. the CPUs can read the interrupt Status registers to de- 

Programmable option select registers 85 provide a 30 termine the source of the interrupt. After servicing the 

programmable mechanism to set up configuration data interrupt, the CPUs may selectively clear the interrupt 

for adapters on the microchannel, eliminating the need by writing a *r to the corresponding bit of the interrupt 

for jumpers and switches. These registers are pro- status register, 
grammed by the host processor during initial system set 

up. 35 Hardware Semaphores 

Address comparator 84 and address generator 83 In a preferred embodiment, the present invention has 
together comprise an address processor, used only in 16 hardware semaphores, shown as block 100, to serial- 
slave mode. Comparator 85 is used to detect bus ac- ize access to serially reusable resources or to indicate 
cesses at addresses within various programmable when a specific event has occurred. The semaphores are 
ranges. It comprises a host mapped memory enable 40 one bit wide registers exclusively used by software 
register, starting address registers, and address compar- When written to, these registers act like any writable 
ators for detection of valid host mapped memory and register. When read by either the host or local CPU, if 
host I/O addresses. The registers are programmable by the semaphore is set to "1", a 'T' returns. If set to 't)" 
both the local and host CPU. Address generator 83 the act of reading the semaphore changes the state of 
computes local memory addresses for host mapped 45 the semaphore to a "1**, and returns a ''O" to the CPU 
memory cycles. It comprises a series of registers for all Thus, only one instruction is required to both test and 
four host mapped memory windows, a 24-bit adder, set the semaphore. 

address latches for buffered write operations and tri- Access to the semaphores is controlled so that only 

State drivers for the internal address bus. one access to a semaphore can occur at any given time 

Bus interface transceivers 87 are transparent latches 50 If two requests do occur neariy simultaneously to the 

for latching 32-bits of bus address when the interface same semaphore, arbitration logic automatically serial- 

unit 30 is in a slave mode. The data transceivers drive izes the requests by giving priority to requests made by 

and receive data, and include data pipeline latches for the host. 

receiving data and data steering logic to support byte/- The present invention has now been described in one 

word/double word transfers. In master mode, the con- 55 specific embodiment. As modifications of this embodi- 

trol transceivers arc driven by the sequencer to generate ment are readUy foreseeable, it is intended that this 

data transfer cycles. Also, several of the returned sig- description and the following claims should encompass 

nals from the selected card are used by the interface unit these changes and modifications, 

to adapt the duration and format of the basic transfer We claim: 

cycle to the selected card characteristics. In slave mode, 60 1. In a multiprocessor computer system having at lest 
the transfer control transceivers are used for input to two processors, each processing having an associated 
the interface unit. The arbitration control and slave memory and each processor being coupled to the other 
return transceivers are not used and the interface unit through an interface unit, the processors also sharing a 
dnves the slave group of signals. resource, interface apparatus for maintaining and signal- 
Register Arbiter 110 arbitrates between register ac- 65 ing an accessibility status of said shared resource, said 
cesses by the two CPUs. All registers arc readable by interface apparatus comprising: 
both CPUs. Write accessibility of the interface unit's a hardware semaphore for said shared resource, the 
registers is grouped in four classes: registers writable semaphore being one bit wide; 



04/29/2004, EAST Version: 1.4.1 



5,276,886 



10 



15 



hardware circuitry means for detecting when one of 
said processors attempts to write a value to the 
semaphore and for forcing the semaphore to said 
written value regardless of any previous value of 
the semaphore; 

hardware circuitry means for detecting when one of 
said processors attempts to read the semaphore, 
and for determining a status of the semaphore; and 

hardware circuitry meansi coupled to said detecting 
and determining means, for returning a zero value 
and simultaneously resetting the semaphore to a 
one if the status is a zero, and for returning a one 
value if the status is a one, wherein a returned value 
of zero indicates that the shared resource is accessi- 
ble. 

2. A method for controlling synchronization between 
processors in a multi-processor system, the system hav- 
ing at least two processors, each processor having an 
associated memory and each processor being coupled to 
the other through an interface unit having a hardware 20 
semaphore means, the method comprising the steps of: 
initiating a first write access with a value of one to the 

semaphore means by one of said processors; 
in response to said first write access, forcing the 
semaphore means to a value of one regardless of 25 
any previous value of the semaphore; 
signalling an event in one of said processors by initiat- 
ing a second write access with a value of zero by 
said processor to the semaphore means; 
in response to said second write access, forcing the 30 

semaphore means to a value of zero; and 
reading a synchronization value from the semaphore 
means by one of said processors to determine if an 
event has occurred, the value of the semaphore 
means remaining a one if said synchronization 35 



value was a one and the value of the semaphore 
means changing to one automatically if said syn- 
chronization value was a zero. 

3. The method of claim 2, wherein said multiproces- 
sor system comprises more than two processors, 
wherein each of said processors access said semaphore 
means to maintain synchronization. 

4. A method for maintaining and signaling accessibil- 
ity status for resources such as input/output devices and 
counters in a multi-processor system, the system having 
at least two processors, each processor having an asso- 
ciated memory and each processor being coupled to the 
other through an interface unit having a hardware 
semaphore means for a shared resource, the method 
comprising the steps of: 

initiating a first write access with a value of one by 
one of said processors to the semaphore means; 

in response to said first write access, forcing the 
semaphore means to a value of one regardless of 
any previous value of the semaphore; 

initiating a second write access with a value of zero 
by one of said processors to the semaphore means; 

in response to said second write access, forcing the 
semaphore means to a value of zero; 

initiating a read access of the semaphore means by 
one of said processors; and 

in response to said read access of the semaphore 
means, returning a status equal to a value of the 
semaphore means and automatically changing the 
value of the semaphore means to a one if the status 
is a zero, the semaphore means value remaining 
unchanged if the status is a one, a returned status 
equal to zero indicating that the resource associ- 
ated with the semaphore means can be accessed. 
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